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Description 

SYSTEM AND METHOD FOR DIAGNOSING 
FAULTS UTILIZING BASELINE MODELING 

TECHNIQUES 

Cross Reference to Related Applications 

[0001] The present application is a continuation-in-part of co- 
pending United States Patent Application No. 09/682,314, 
filed on August 17, 2001, the entirety of which is incorpo- 
rated by reference herein. 
Background of the Invention 

[0002] The present invention relates generally to systems and 
methods for improving the quality and productivity of a 
product or process and more particularly to baseline 
modeling of a product or process. 

[0003] Baseline modeling of a product or process generally pro- 
vides an understanding of the performance of an ideal 
product or process over time. An engine is one type of 
product or process that baseline modeling is suitable for 



use. Engine baseline modeling has a multitude of uses in- 
cluding, but not limited to, determining when an engine 
performs out of specification, predicting when an engine 
failure will occur, detecting abnormal conditions, deter- 
mining the quality of an engine and designing new en- 
gines. Typically, engine baseline models are developed 
from data gathered from thermodynamic cycle analyses 
and simulation. First, models of ideal values are created 
and indexed by variables such as altitude, temperature, 
power setting, and air speed. Once data from the normal 
operation of the engine is available, these models are up- 
dated by analyzing data corresponding to a particular 
model characteristic. An engineer then looks for data that 
are similar for the specified engine variables (e.g., alti- 
tude, temperature, power setting, air speed), groups the 
similar data, averages them for each variable and per- 
forms other operations as desired. The engineer then 
plots data for each of the variables. The plots provide in- 
terrelationship information between each of the engine 
variables, which the engineer uses to create tables of typi- 
cal operation parameters the baseline model. These tables 
of parameters are used as the basis of comparison for en- 
gine operation. Differences from the baseline model may 



indicate engine faults or deterioration trends. 

[0004] There are several problems associated with this type of 
engine baseline modeling. First, this type of engine base- 
line modeling is very labor intensive because the engineer 
has to review the data, find data that are similar, group 
and average the data, perform other desired operations 
on the data, plot the data and create tables. Another 
problem is that one engineer cannot readily reproduce an 
engine baseline model developed by another engineer be- 
cause this process is very individualized. It is helpful if 
one engineer can reproduce the engine baseline model 
generated by another engineer to validate quality of the 
baseline. Another problem associated with this type of 
engine baseline model is that the resulting model does 
not provide a good picture of an engine operating outside 
normal conditions. Furthermore, this type of engine base- 
line modeling does not provide a measure of how good 
the developed model is. 

[0005] Accordingly, there is a need in the art of statistical model- 
ing for an automated approach to engine baseline model- 
ing that standardizes the process to improve reliability by 

minimizing human interventions. 
Brief Summary of the Invention 



[0006] The present invention overcomes the problems noted 

above, and provides additional advantages, by providing a 

system, method and computer readable medium that 

stores instructions for instructing a computer system, to 

perform engine baseline modeling. In one embodiment of 

the present invention, an engine service database contains 

engine data relating to time varying measurements for at 

least one engine. A preprocessor processes the engine 

data into a predetermined format. An engine baseline 

modeling component builds an engine baseline model 

from the preprocessed data, using a regression analysis, 

and wherein the output engine baseline model is adjusted 

for time varying effects on the measured/modeled data 

and parameters. Estimate trends for each parameter of the 

baseline are calculated to assist in fault determination. 
Brief Description of the Drawings 

[0007] The present invention can be understood more completely 
by reading the following Detailed Description of Preferred 
Embodiments, in conjunction with the accompanying 
drawings. 

[0008] FIG. 1 shows a schematic diagram of a general-purpose 

computer system in which a system for performing engine 
baseline modeling operates. 



[0009] FIG. 2 shows a top-level component architecture diagram 
of the engine baseline modeling system that operates on 
the computer system shown in FIG. 1. 

[0010] FIG. 3 shows a flow chart describing actions performed by 
the engine baseline modeling system shown in FIG. 2. 

[0011] FIG. 4 is a flow chart describing one alternative embodi- 
ment for actions performed by the engine baseline mod- 
eling system of FIG. 3. 

[0012] FIG. 5 is a flow chart illustrating one embodiment of a 
method for diagnosing system faults utilize the baseline 
models and engine parameter trends calculated in FIG. 4. 

[0013] FIG. 6 shows an architectural diagram of a system for im- 
plementing the engine baseline modeling system shown in 
FIG. 2. 

[0014] FIG. 7 is an exemplary graph of raw exhaust gas tempera- 
ture data over time. 

[0015] FIG. 8 is an exemplary graph of exhaust gas temperature 
difference from baseline model. 

[0016] FIG. 9 is an exemplary graph of principal component anal- 
ysis mapped data. 
Detailed Description of the Preferred Embodiments 

[0017] This disclosure describes a system, method and computer 
product for baseline modeling of a product or process 



such as an aircraft engine, however, the disclosure is ap- 
plicable to any type of product or process where it is de- 
sirable to model performance. FIG. 1 shows a schematic 
diagram of a general-purpose computer system in which 
a system for performing engine baseline modeling oper- 
ates. The computer system 10 generally comprises a pro- 
cessor 12, memory 14, input/output devices, and data 
pathways (e.g., buses) 16 connecting the processor, 
memory and input/output devices. The processor 12 ac- 
cepts instructions and data from memory 14 and per- 
forms various operations. The processor 12 includes an 
arithmetic logic unit (ALU) that performs arithmetic and 
logical operations and a control unit that extracts instruc- 
tions from memory 14 and decodes and executes them, 
calling on the ALU when necessary. The memory 14 gen- 
erally includes a random-access memory (RAM) and a 
read-only memory (ROM), however, there may be other 
types of memory such as programmable read-only mem- 
ory (PROM), erasable programmable read-only memory 
(EPROM) and electrically erasable programmable read-only 
memory (EEPROM). Also, memory 14 preferably contains 
an operating system, which executes on the processor 12. 
The operating system performs basic tasks that include 



recognizing input, sending output to output devices, 
keeping track of files and directories and controlling vari- 
ous peripheral devices. 
[0018] The input/output devices may comprise a keyboard 18 
and a mouse 20 that enter data and instructions into the 
computer system 10. Also, a display 22 may be used to 
allow a user to see what the computer has accomplished. 
Other output devices may include a printer, plotter, syn- 
thesizer and speakers. A communication device 24 such 
as a telephone or cable modem or a network card such as 
an Ethernet adapter, local area network (LAN) adapter, in- 
tegrated services digital network (ISDN) adapter, Digital 
Subscriber Line (DSL) adapter or wireless access card, en- 
ables the computer system 10 to access other computers 
and resources on a network such as a LAN, wireless LAN 
or wide area network (WAN). A mass storage device 26 
may be used to allow the computer system 10 to perma- 
nently retain large amounts of data. The mass storage de- 
vice may include all types of disk drives such as floppy 
disks, hard disks and optical disks, as well as tape drives 
that can read and write data onto a tape that could include 
digital audio tapes (DAT), digital linear tapes (DLT), or 
other magnetically coded media. The above-described 



computer system 10 can take the form of a hand-held 
digital computer, personal digital assistant computer, 
notebook computer, personal computer, workstation, 
mini-computer, mainframe computer or supercomputer. 

[0019] FIG. 2 shows a top-level component architecture diagram 
of an engine baseline modeling system 28 that operates 
on the computer system 10 shown in FIG. 1. Generally, the 
engine baseline modeling system 28 models the perfor- 
mance of an "ideal engine for a specified type aircraft en- 
gine. An engine baseline model built with the engine 
baseline modeling system 28 has a multitude of uses. An 
illustrative, but non-exhaustive list of potential uses for 
an engine baseline model built from the engine baseline 
modeling system 28 includes monitoring engine status, 
predicting future engine behavior, diagnosing engine 
faults, determining when an engine performs out of spec- 
ification, determining the quality of an engine and design- 
ing new systems for an engine. 

[0020] | n FIG. 2, there is an engine service database 30 that con- 
tains engine data for a variety of aircraft engines. The en- 
gine data comprises an assortment of engine performance 
information for each of the engines. Generally, engine 
performance information includes environmental data in 



which the engines operate such as altitude, air tempera- 
ture, air speed, engine loading, engine temperature and 
pressure. One of ordinary skill in the art will recognize 
that the engine service database 30 may comprises other 
engine performance information such as mach, fan speed, 
etc. In addition, the engine service database 30 may com- 
prise other data such as operational data. A non- 
exhaustive list of engine operational data stored in the 
engine service database 30 includes exhaust gas temper- 
atures (EGT), shaft speed between compressors and tur- 
bines (Nl and N2), pressure combustion (Pa) and fuel flow 
(WF). One of ordinary skill in the art will recognize that 
other engine operational data may include engine bleed 
settings, vibration readings, and control mechanism set- 
tings. Also, the engine service database 30 may comprise 
other data such as aircraft operating and settings data 
(e.g., bleed settings). 
[0021] one of ordinary skill in the art will recognize that there 
are a variety of approaches to acquiring the above data 
and storing them in the engine service database 30. For 
example, some data can be captured automatically using 
on-line data acquisition techniques, while other data can 
be captured using manually recording techniques or on- 



board data capture techniques. Furthermore, the engine 
service database 30 preferably stores the data in a format 
that permits users to import the data into other tools for 
further analysis, such as Microsoft EXCEL®, Minitab™, and 
SAS™ 

[0022] The engine service database 30 may comprise other types 
of data for the engines. For example, the engine service 
database 30 may comprise service information for the en- 
gines. Generally, the service information will comprise in- 
formation such as engine product information, age infor- 
mation of the engines and repair history of the engines 
(e.g., dates of service events, types of service events, etc.). 
Other types of engine data stored in the engine service 
database 30 may include in-flight data, engine utilization 
data (e.g., where, when, how flown), ownership data, re- 
mote monitoring and diagnostics status data. 

[0023] Referring to FIG. 2, the engine baseline modeling system 
28 comprises a preprocessor 32 that processes the engine 
data into a predetermined format. In particular, the pre- 
processor 32 comprises a data acquisition component 38 
that extracts the engine data from the engine service 
database 30. The data acquisition component 38 acquires 
the service data by using commercially available modules 



available from Minitab , Microsoft , Oracle , etc. which 
directly extract the data into the engine baseline modeling 
system 28, however, one of ordinary skill in the art will 
recognize that one can write specialized code to extract 
the data into a common format and write additional spe- 
cialized code to import that into the system. 
[0024] The preprocessor 32 performs computations that simplify 
future processing of the data, while a data scrubbing 
component 40 cleans the engine data. In particular, the 
preprocessor 32 performs operations that convert the 
data into standard units. For example, the preprocessor 
32 can convert temperature data from Celsius to Kelvin or 
correct engine power setting data by the engine bleed 
settings. Other types of corrections that the preprocessor 
32 may perform include converting pounds to kilograms, 
altitude to pressure, knots and altitude to mach number. 
One of ordinary skill in the art will recognize that the 
listed corrections are only illustrative of some possibilities 
and are not exhaustive of other possibilities. Furthermore, 
one of ordinary skill in the art will recognize that the pre- 
processor 32 can perform the corrections in any manner 
desired (e.g., Celsius to Rankine) and is not limited to the 
above order. Examples of cleaning operations performed 



by the data scrubbing component 40 include discarding 
data elements with missing values, correcting simple ty- 
pographical errors, discarding data elements with erro- 
neous values out of reasonable operating range, etc. One 
of ordinary skill in the art will recognize that the listed 
cleaning operations are only illustrative of some possibili- 
ties and are not exhaustive of other possibilities. 
[0025] | n addition, the preprocessor also comprises a data seg- 
menting component 42 that segments engine data into 
groups, nodes or clusters that represent similar operating 
conditions. The groups generally include engine perfor- 
mance variables such as power setting, altitude, air speed 
(mach number), and air temperature. One of ordinary skill 
in the art will recognize that other engine performance 
variables such as air humidity and control settings may be 
selected and that the disclosure should not be limited to 
these variables. Once the groups have been selected, then 
the data segmenting component 42 can segment the data 
into the particular group that it relates to. Once the data 
are segmented into the groups, then the data segmenting 
component 42 can use a cluster analysis to determine 
clusters of operating conditions. Alternatively, an engineer 
may assign bands of operations of interest for each of the 



variables. 

[0026] An engine baseline modeling component 34 builds an en- 
gine baseline model from the data processed by the pre- 
processor 32. In particular, the engine baseline model 
built by the engine baseline modeling component relates 
the selected performance variables as a function of engine 
operating conditions using the processed data. Engine 
operating conditions include engine, aircraft and environ- 
mental conditions. In this disclosure, the engine baseline 
model is built from a regression analysis. Generally 
speaking, a regression is the statistical science of deter- 
mining an equation from a finite number of points that 
provides values of Yfor a given X, i.e., Y=f(X). In this dis- 
closure, the equation to be determined can be expressed 
as: 

[0027] 

Y = f (altitude, temperature, power setting, air speed) (1) 

[0028] where altitude, temperature, power setting and air speed 
are the X variables. The engine baseline modeling compo- 
nent 34 performs a regression to determine the above 
equation for each of the selected engine performance 
variables (i.e., power setting, altitude, air speed, and air 
temperature) during specified times that the engine is op- 
erating. For instance, the engine modeling component 34 



can perform the regression on the data taken during the 
take-off, climb and cruise for any or all of the engine per- 
formance variables. One of ordinary skill in the art will 
recognize that more engine performance variables (air hu- 
midity, control settings, etc.) or less engine performance 
variables can be used in equation 1. In addition, one of 
ordinary skill in the art will recognize that different com- 
binations of engine performance variables can be used in 
equation 1. 

[0029] | n general, a regression fits a parametric equation to a set 
of data by solving for values of regression parameters 
such that the best fit to the data set is obtained. Multiple 
linear regression is a type of regression that solves the 
system of equations, minimizing the combined error. In 
this disclosure, the system of equations that the regres- 
sion solves can be as follows: 



[0030] 



y[l] = a*power 



setting[l] + b*altitude[l] + c*temperature[l] + 



d*airspeed[l] + ... + error[l] 



y[2] = a*power 



setting[2] + b*altitude[2] + c*temperature[2] + 



d* airspeed [2] + ... + error [2] 



y[n] = a*power 



setting[n] + b*altitude[n] + c*temperature[n] + 



d*airspeed[n] + ... + error[n] 



(2) 



[0031] wherein a, b, c, d are the regression parameters and 

power setting[l], altitude[l], temperature [I], airspeed[l], y[l] 
are observed events. Again, one of ordinary skill in the art 
will recognize that the system of equations can differ de- 
pending on the selection of engine performance variables. 

[0032] The resulting parameter estimates for a, b, c and d are 

representative of the new baseline model. Instead of using 
tables to develop the baseline model as was done in the 
past, there is now a simple equation that describes the 
baseline behavior of the engine, from which either the ta- 
bles may be generated, or the equation can be applied di- 
rectly. For example, a baseline model for the exhaust gas 
temperature (EGT) parameter might be as follows: 

[0033] 

EGT = 0.1 * power + 0.001 * altitude + 0.0 1* temperature + 0.05 * airspeed (3) 

[0034] | n this example, an engine that had power set to 100, alti- 
tude at 1000, temperature at 300, and air speed at 200, 
would result in a predicted EGT value of 24 degrees, but 
might have a measured EGT value of 14 degrees, which 
would indicate that the engine was 10 degrees below the 
predicted value. 

[0035] The engine baseline modeling component 34 also com- 
prises a metric component 44 that validates the engine 



baseline model. In particular, the metric component 44 
validates the engine baseline model by examining the 
quality of the built model. In this disclosure, the metric 
component 44 determines the goodness of model fit by 
analyzing statistical goodness of fit metrics, such as R- 
squared, which is a common regression tool output. One 
of ordinary skill in the art will recognize that the metric 
component 44 can determine other metrics besides the R- 
squared metric such as the mean square error, sum 
square error and sigma metrics, which are other common 
regression tool outputs. 

[0036] The engine baseline modeling component 34 also com- 
prises a data cleaning heuristics component 46 that 
cleans the preprocessed data according to a set of heuris- 
tics. Generally, the data cleaning heuristics component 46 
uses heuristics to remove data that does not conform to 
the norm. An illustrative, but non-exhaustive list of data 
that the data cleaning heuristics component 46 removes 
includes regression outliers, regression leverage points, 
and faulty engines. In this disclosure, this data cleaning 
operation can be performed for groups of engines or 
fleets of aircraft that use a common engine. 

[0037] Additional details and embodiments relating to the engine 



baseline modeling component 34 will be set forth in detail 
below. In particular, inventive techniques are provided 
which additionally enhance the quality and reliability of 
the generated model. 
[0038] FIG. 2 also shows that the engine baseline modeling sys- 
tem 28 comprises a model diagnostics component 36 that 
evaluates the performance of the engine baseline model. 
In particular, the model diagnostics component 36 gener- 
ates statistical outputs that provide statistical information 
to a user of the engine baseline modeling system 28. An 
illustrative, but non-exhaustive list of the statistical out- 
puts that the model diagnostics component 36 generates 

2 

includes variance, r , collinearity, probability plots, resid- 
ual plots, standard error measurements, confidence limits 
on the engine baseline model, prediction limits, pure error 
lack-of-fit test, data subsetting lack-of-fit test, multicol- 
inearity metrics (variance inflation factors), autocorrelation 
of residuals (Durbin-Watson statistic), etc. 
[0039] The algorithms performed by the components in the en- 
gine baseline modeling system 28 (i.e., the preprocessor 
32, engine baseline modeling component 34 and model 
diagnostics component 36 can be programmed with a 
commercially available statistical package such as SAS, but 



other languages such as C or Java may also be used. 

[0040] The engine baseline modeling system 28 is not limited to 
a software implementation. For instance, the preprocessor 
32, engine baseline modeling component 34 and model 
diagnostics component 36 may take the form of hardware 
or firmware or combinations of software, hardware, and 
firmware. In addition, the engine baseline modeling sys- 
tem 28 is not limited to the preprocessor 32, engine 
baseline modeling component 34 and model diagnostics 
component 36. One of ordinary skill in the art will recog- 
nize that the engine baseline modeling system 28 may 
have other components. 

[0041] FIG. 3 shows a flow chart describing actions performed by 
the engine baseline modeling system 28 shown in FIG. 2. 
At block 48, the data acquisition component 38 extracts 
the engine data from the engine service database 30. 
Next, at 50 the user of the engine baseline modeling sys- 
tem 28 selects a particular engine model and engine per- 
formance variables for that engine that he or she would 
like to model (e.g., power setting, altitude, air speed and 
air temperature). The preprocessor 32 converts data into a 
standardized format at 52 and the data scrubbing compo- 
nent 40 cleans the engine data at 54. The data segment- 



ing component 42 then segments the engine data into 
groups such as altitude, air speed and air temperature, 
fuel specific heat value, air humidity, control settings, etc. 
at 56. 

[0042] After the data segmenting component 42 has segmented 
the engine data into groups, the engine baseline modeling 
component 34 builds an engine baseline model from the 
data processed by the preprocessor 32. In particular, the 
building of the engine baseline model begins with the en- 
gine baseline modeling component 34 performing a re- 
gression to determine parameters for each of the selected 
engine performance variables (e.g., altitude, air speed and 
air temperature) at 60. As mentioned above, the regres- 
sion relates the engine performance variables as a func- 
tion of engine operating conditions. The metric compo- 
nent 44 then determines the residuals of the regression at 
62 and applies the metrics (e.g., R-square, mean square 
error, sum square error and sigma metrics) at 64. The 
term residuals refers to the differences between the actual 
values of the dependent variables and their predicted or 
estimated value for a particular observation. Next, the 
data cleaning heuristics component 46 cleans the prepro- 
cessed data according to a set of heuristics and generates 



certain statistics such as outliers and leverage points at 
66. 

[0043] The OLE_LINKlengine baseline modeling component 34 
OLE_LINKlthen performs another regression at 68. The 
engine baseline modeling component 34 applies a second 
regression to improve the parameter estimates by using a 
cleaner data set. Again, the metric component 44 deter- 
mines additional residuals at 70. Alternatively, the metric 
component 44 can generate plots of the residuals if a user 
desires. At 72, the engine baseline modeling component 
determines whether there are any more segments that 
have to be analyzed. If there are more segments, then the 
next segment is analyzed at 74 and blocks 60-72 are re- 
peated. This process continues until it is determined at 72 
that there are no more segments. Once it has been deter- 
mined that there are no more segments, then the model 
diagnostics component 36 evaluates the performance of 
the of the engine baseline model at 76 and generates cer- 
tain statistical outputs that relate to the model. 

[0044] Referring now to FIG. 4, there is shown a flow chart de- 
scribing one alternative embodiment for actions per- 
formed by the engine baseline modeling system 28 be- 
ginning at block 56 of FIG. 3. Contrary to the embodiment 



of FIG. 3 wherein a baseline model is computed using data 
for a fleet of engines over a long period of time, the 
present embodiment performs a similar analysis for indi- 
vidual engines. Changes to the calculated model are then 
used to determine the health or condition of the individual 
engine. More particularly, a step of segmenting engine 
data 100 is now characterized by grouping engine data by 
each specific engine and further including information re- 
garding the time at which each measurement was 
recorded. Furthermore, inclusive in each engines data 
grouping could also alternatively include subgroupings or 
segments for any desired length of time. For example, en- 
gine data for a given engine could span across three years 
time. In accordance with the present embodiment, an en- 
gine baseline model could initially be calculated for this 
entire span. Additionally, subgroupings of engine data 
measurements based upon specific time intervals could 
also be used to generate baseline models. In one embodi- 
ment, baseline models for each distinct 60 day period 
could be calculated (e.g., days 1-60, days 61-120, etc.), 
resulting in 18 distinct baseline models. Alternatively, 
baseline models for each 60 day period within the overall 
span could be calculated (e.g., days 1-60, days 2-61, 



etc.), resulting in 1000 distinct baseline models. 

[0045] Once engine segments have been identified, the system 
proceeds to block 102 where the engine baseline model- 
ing component 34 builds an engine baseline model by 
performing a regression to determine parameters for each 
of the selected engine performance variables (e.g., alti- 
tude, air speed and air temperature) for a first data seg- 
ment (i.e., grouped engine data for a selected time pe- 
riod). The metric component 44 then determines the 
residuals of the regression at block 104 and applies the 
metrics at block 106. Next, the data cleaning heuristics 
component 46 cleans the preprocessed data according to 
a set of heuristics and generates certain statistics such as 
outliers and leverage points at block 108. 

[0046] The engine baseline modeling component 34 then per- 
forms another regression at block 110. Again, the metric 
component 44 determines additional residuals at block 
112. At block 114, the engine baseline modeling compo- 
nent determines whether there are any more segments 
(e.g., time periods) that have to be analyzed. If there are 
more segments, then the next segment is analyzed at 
block 116 and blocks 102-114 are repeated. This process 
continues until it is determined at block 114 that there are 



no more segments. Once it has been determined that 
there are no more segments, a time series for each of the 
estimated parameters can be generated from the combi- 
nation of models or estimates, (fan speed coefficient, air 
temperature coefficient, etc) at block 118. This results in a 
more accurate picture of engine state changes during the 
time under measurement. Based upon the time series of 
each estimated parameter generated in block 118, a trend 
for each parameter is calculated in block 120. 
[0047] | t has been determined that when faults occur in the sys- 
tem, the systerrTs baseline trend will often shift. Unfortu- 
nately, the shift only alerts you to a problem without 
pointing to a specific cause. As one example, consider a 
fuel economy trend: if vehicle fuel economy changes, one 
would expect to find a fault somewhere in the system. 
However, under the embodiment of FIG. 3, significant in- 
formation regarding what the actual fault was would not 
present itself. 

[0048] Referring now to FIG. 5, there is shown a flow chart illus- 
trating one embodiment of a method for diagnosing sys- 
tem faults utilizing the baseline models and engine pa- 
rameter trends calculated in FIG. 4. Initially, in block 122, 
the baseline model generated in FIG. 4 is monitored for 



trend fluctuations representative of an engine fault. In 
block 124, if a potential fault occurrence is identified, the 
parameter estimate trends associated with the model are 
evaluated. Because each of the parameter estimate trends 
calculated in block 120 basically remain constant for a 
stable system, one would expect one or more of these pa- 
rameter estimate trends to shift when a corresponding 
engine baseline trend shift occurs. In block 126, one or 
more of the parameter estimate trends relating to the 
baseline model trend shift is identified as well its manner 
of fluctuation. Such parameter estimate trend shifts help 
narrow down the possible system faults. 

[0049] Returning to the fuel economy example initially set forth 
above, if a baseline shift indicates a problem, one may 
identify that parameter estimate A has shifted downwards 
and parameter estimate B has shifted upwards. Con- 
versely, one might see A and B remaining constant with a 
large shift in parameter C. This fault signature could be 
used to identify case 1 as a fuel system problem and case 
2 as an air system problem, reducing the effort needed to 
diagnose the fault. 

[0050] | n essence, by calculating a variety of engine baseline 

models utilizing moving time windows, estimates for each 



parameter measured may be accurately trended, thereby 
synthesizing additional information useful for diagnosing 
the cause of the fault. 
[0051] it should be understood that a variety of methods may be 
utilized in accordance with the present invention to iden- 
tify faults reflective of the relationships between baseline 
model trend shifts and parameter estimate changes. In 
one embodiment, conditional probabilities may be utilized 
to identify such faults. In such an embodiment, the proba- 
bilities for each potential fault reflective of the trends/ 
parameter estimate information obtained are determined. 
In the example set forth above, probabilities each poten- 
tial fault (e.g., faulty fuel system, faulty air system etc.) 
given the changes identified in each parameter estimate 
A, B, and C would be calculated. The results of these 
probability calculations would then assist in identifying 
likely faults. 

[0052] | n a second embodiment, a case-based reasoning system 
may be employed to identify likely faults. Using this 
method, a set of "cases" are established correlating to 
various potential faults. Typically, these cases would be 
based upon measurements made during known fault oc- 
currences. Once the cases have been established, changes 



in each parameter estimate are then compared to the 
cases and a closest case is identified as a likely fault. 
[0053] | n some circumstances, it has been determined that the 

parameters estimated relate to highly correlated variables. 
That is, changes in one parameter precipitate changes in 
another parameter (for example, air temperature drops 
when altitude increases). This correlation causes the pa- 
rameter estimates to be unstable. Accordingly, an alterna- 
tive embodiment of the present invention refines the 
above-described process, wherein the principle compo- 
nent analysis technique is utilized to map the initial en- 
gine data to an uncorrelated dataset. In a preferred em- 
bodiment, engine data relating to the parameter of inter- 
est are not mapped. This uncorrelated dataset is then 
used in the manner set forth above to compute the pa- 
rameter estimate trends. These trends can then be used 
as before to diagnose the faults using a decision making 
technique such as an expert system, case-based reason- 
ing, etc. 

[0054] Principal component analysis involves a mathematical 

procedure that transforms a number of (possibly) corre- 
lated variables into a smaller number of uncorrelated vari- 
ables which are called principal components. Of the iden- 



tified components, the first accounts for as much of the 
variability in the data as possible, while each succeeding 
component accounts for as much of the remaining vari- 
ability as possible. Referring now to FIGS. 8-10, there is 
shown a series of graphs illustrating a correlated data sets 
and how the principal component analysis embodiment of 
the present invention overcome correlation instability. 

[0055] FIG. 7 is one example of a graph of raw exhaust gas tem- 
perature data over time (It should be understood that ex- 
haust gas temperature is just one of a variety of engine 
performance variables which may apply to the given em- 
bodiment.) Once the time-adjusted regression analysis 
has been performed, a clear baseline shift with respect to 
the exhaust gas temperature variable is demonstrated in 
FIG. 8. However, because changes in various ones of the 
measured parameters necessarily affect other parameters 
through a naturally occurring correlation, these correla- 
tions must be removed to generate stable estimates. 

[0056] Accordingly, a principal component analysis is performed 
in which initial engine data is mapped by the preprocess- 
ing component to uncorrelated datasets using the princi- 
pal component analysis technique. FIG. 9 illustrates a 
graph of one resulting data set with new principal compo- 



nents pi, p2 and p3. 

[0057] The foregoing flow charts of FIGS. 3-5 of this disclosure 
shows the functionality and operation of the engine base- 
line modeling system 28. In this regard, each block repre- 
sents a module, segment, or portion of code, which com- 
prises one or more executable instructions for imple- 
menting the specified logical function(s). It should also be 
noted that in some alternative implementations, the func- 
tions noted in the blocks may occur out of the order noted 
in the figures or, for example, may in fact be executed 
substantially concurrently or in the reverse order, de- 
pending upon the functionality involved. Furthermore, the 
functions can be implemented in programming languages 
such as C and Java, however, other computer program- 
ming languages can be used. Also, the engine service 
database 30 may be constructed using commercial 
databases including standard relational, object-oriented 
or hierarchical databases. Data manipulation operations, 
including retrieval, creation, modification, and deletion, 
can be implemented within the programming languages 
or within the database using stored procedures or triggers 
or combinations. 

[0058] The above-described engine baseline modeling system 28 



comprises an ordered listing of executable instructions for 
implementing logical functions. The ordered listing can be 
embodied in any computer-readable medium for use by 
or in connection with a computer-based system that can 
retrieve the instructions and execute them. In the context 
of this application, the computer-readable medium can be 
any means that can contain, store, communicate, propa- 
gate, transmit or transport the instructions. The computer 
readable medium can be an electronic, a magnetic, an op- 
tical, an electromagnetic, or an infrared system, appara- 
tus, or device. An illustrative, but non-exhaustive list of 
computer-readable mediums can include an electrical 
connection (electronic) having one or more wires, a 
portable computer diskette (magnetic), a random access 
memory (RAM) (magnetic), a read-only memory (ROM) 
(magnetic), an erasable programmable read-only memory 
(EPROM or Flash memory) (magnetic), an optical fiber 
(optical), and a portable compact disc read-only memory 
(CDROM) (optical). 
[0059] N 0 te that the computer readable medium may comprise 
paper or another suitable medium upon which the in- 
structions are printed. For instance, the instructions can 
be electronically captured via optical scanning of the pa- 



per or other medium, then compiled, interpreted or other- 
wise processed in a suitable manner if necessary, and 
then stored in a computer memory. 
[0060] FIG. 6 shows an architectural diagram of a system 78 for 
implementing the engine baseline modeling system 28 
shown in FIG. 2 in a networked environment. In FIG. 6, a 
user uses a computing unit 80 to access the engine base- 
line modeling system 28 and engine service database 30. 
More specifically, the computing unit 80 connects to the 
engine baseline modeling system 28 and engine service 
database 30 through a communication network 82 such 
as an electronic or wireless network. The computing unit 
80 can take the form of a hand-held digital computer, 
personal digital assistant computer, notebook computer, 
personal computer or workstation, while the communica- 
tions network may be a private network such as an ex- 
tranet or intranet or a global network such as a WAN (e.g., 
Internet). A web server 84 serves the engine baseline 
modeling system 28 and the engine service database 30 
to the user in the form of web pages. The web pages can 
be in the form of HTML, however, other formats and 
structures can be used such as SGML, XML or XHTML. The 
user uses a web browser 86 running on the computing 



unit 80 such as Microsoft INTERNET EXPLORER, Netscape 
NAVIGATOR or Mosaic to locate and display the web pages 
generated from the engine baseline modeling system 28 
and engine service database 30. 

[0061] if desired, the system 78 may have functionality that en- 
ables authentication and access control of users accessing 
the web pages linked to the engine baseline modeling 
system 28. Both authentication and access control can be 
handled at the web server level by a commercially avail- 
able package such as Netegrity SITEMINDER. Information 
to enable authentication and access control such as the 
user names, location, telephone number, organization, lo- 
gin identification, password, access privileges to certain 
resources, physical devices in the network, services avail- 
able to physical devices, etc. can be retained in a database 
directory. The database directory can take the form of a 
lightweight directory access protocol (LDAP) database, 
however, other directory type databases with other types 
of schema may be used including relational databases, 
object-oriented databases, flat files, or other data man- 
agement systems. 

[0062] | t j S apparent that there has been provided in accordance 
with this invention, a baseline modeling system, method 



and computer product. While the foregoing description 
includes many details and specificities, it is to be under- 
stood that these have been included for purposes of ex- 
planation only, and are not to be interpreted as limitations 
of the present invention. Many modifications to the em- 
bodiments described above can be made without depart- 
ing from the spirit and scope of the invention, as is in- 
tended to be encompassed by the following claims and 
their legal equivalents. 



